NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Community Detection with Heterogeneous Block Covariance Model

https://doi.org/10.1080/10618600.2025.2505725

Li, Xiang; Zhao, Yunpeng; Pan, Qing; Hao, Ning (July 2025, Journal of Computational and Graphical Statistics)

Free, publicly-accessible full text available July 3, 2026
Variational Estimators of the Degree-corrected Latent Block Model for Bipartite Networks

Zhao, Yunpeng; Hao, Ning; Zhu, Ji (May 2024, Journal of machine learning research)

Bipartite graphs are ubiquitous across various scientific and engineering fields. Simultaneously grouping the two types of nodes in a bipartite graph via biclustering represents a fundamental challenge in network analysis for such graphs. The latent block model (LBM) is a commonly used model-based tool for biclustering. However, the effectiveness of the LBM is often limited by the influence of row and column sums in the data matrix. To address this limitation, we introduce the degree-corrected latent block model (DC-LBM), which accounts for the varying degrees in row and column clusters, significantly enhancing performance on real-world data sets and simulated data. We develop an efficient variational expectation-maximization algorithm by creating closed-form solutions for parameter estimates in the M steps. Furthermore, we prove the label consistency and the rate of convergence of the variational estimator under the DC-LBM, allowing the expected graph density to approach zero as long as the average expected degrees of rows and columns approach infinity when the size of the graph increases.
more » « less
Full Text Available
A Note on the Identifiability of the Degree‐Corrected Stochastic Block Model

https://doi.org/10.1002/sta4.70067

Park, John; Zhao, Yunpeng; Hao, Ning (May 2025, Stat)

ABSTRACT In this short note, we address the identifiability issues inherent in the degree‐corrected stochastic block model (DCSBM). We provide a rigorous proof demonstrating that the parameters of the DCSBM are identifiable up to a scaling factor and a permutation of the community labels, under a mild condition.
more » « less
Variational Estimators of the Degree-corrected Latent Block Model for Bipartite Networks

Zhao, Yunpeng; Hao, Ning; Zhu, Ji (May 2024, Journal of machine learning research)
Shen, Xiaotong (Ed.)
Full Text Available
Network Inference Using the Hub Model and Variants

https://doi.org/10.1080/01621459.2023.2183133

He, Zhibing; Zhao, Yunpeng; Bickel, Peter; Weko, Charles; Cheng, Dan; Wang, Jirui (April 2024, Journal of the American Statistical Association)

Full Text Available
Editorial: Mapping microbial diversity onto the phylogeny of associated plant species

https://doi.org/10.3389/fpls.2024.1421637

Xiang, Qiu-Yun Jenny; Kivlin, Stephanie N; Soltis, Douglas E; Yu, Shixiao; Chu, Haiyan; Soltis, Pamela S; Zhao, Yunpeng (May 2024, Frontiers in Plant Science)

Full Text Available
Biases in using social media data for public health surveillance: A scoping review

https://doi.org/10.1016/j.ijmedinf.2022.104804

Zhao, Yunpeng; He, Xing; Feng, Zheng; Bost, Sarah; Prosperi, Mattia; Wu, Yonghui; Guo, Yi; Bian, Jiang (August 2022, International Journal of Medical Informatics)

Full Text Available
Integrating Crowdsourcing and Active Learning for Classification of Work-Life Events from Tweets

https://doi.org/10.1007/978-3-030-55789-8_30

Zhao, Yunpeng; Prosperi, Mattia; Lyu, Tianchen; Guo, Yi; Zhou, Le; Bian, Jiang (January 2020, Trends in Artificial Intelligence Theory and Applications. Artificial Intelligence Practices)

Full Text Available
Detecting associations between dietary supplement intake and sentiments within mental disorder tweets

https://doi.org/10.1177/1460458219867231

Wang, Yefeng; Zhao, Yunpeng; Zhang, Jianqiu; Bian, Jiang; Zhang, Rui (September 2019, Health Informatics Journal)

Many patients with mental disorders take dietary supplement, but their use patterns remain unclear. In this study, we developed a method to detect signals of associations between dietary supplement intake and mental disorder in Twitter data. We developed an annotated dataset and trained a convolutional neural network classifier that can identify language use pattern of dietary supplement intake with an F1-score of 0.899, a precision of 0.900, and a recall of 0.900. Using the classifier, we discovered that melatonin and vitamin D were the most commonly used supplements among Twitter users who self-diagnosed mental disorders. Sentiment analysis using Linguistic Inquiry and Word Count has shown that among Twitter users who posted mental disorder self-diagnosis, users who indicated supplement intake are more active and express more negative emotions and fewer positive emotions than those who have not mentioned supplement intake.
more » « less
Full Text Available
Location Prediction with Communities in User Ego-Net in Social Media

https://doi.org/10.1109/ICC.2019.8761695

Wagenseller, Paul; Avram, Adrian; Jiang, Eric; Wang, Feng; Zhao, Yunpeng (May 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC))

Social media embed rich but noisy signals of physical locations of their users. Accurately inferring a user's location can significantly improve the user's experience on the social media and enable the development of new location-based applications. This paper proposes a novel community-based approach for predicting the location of a user by using communities in the egonet of the user. We further propose both geographical proximity and structural proximity metrics to profile communities in the ego-net of a user, and then evaluate the effectiveness of each individual metric on real social media data. We discover that geographical proximity metrics, such as average/median haversine distance and community closeness, are strong indicators of a good community for geotagging. In addition, structural proximity metric conductance performs comparable to geographical proximity metrics while triangle participation ratio and internal density are weak location indicators. To the best of our knowledge, this is the first effort to infer the physical location of a user from the perspective of latent communities in the user's ego-net.
more » « less
Full Text Available

« Prev Next »

Search for: All records